Comparing Czech and Russian valency on the material of VALLEX
نویسنده
چکیده
In this study we have compared Czech and Russian valency frames based on monolingual and bilingual data. We assume that Czech and Russian are close enough to have, for the majority of their verbs, similar valency structures . We have exploited Vallex as a source of valency frames and have used a Czech-Russian dictionary to automatically translate Czech verbs into Russian. Afterwards, we have manually checked whether the Czech frame fits the Russian verb and, in case it was different, we have added the verb to the set that will be described in our paper. We suggest that there is a connection between the semantic class of some verbs and the type of difference between their Czech and Russian valency frames.
منابع مشابه
Transformation of WordNet Czech Valency Frames into Augmented VALLEX-1.0 Format
The paper presents details and comparison of two valuable language resources for Czech, two independent verb valency frames electronic dictionaries. The FIMU verb valency frames dictionary was designed during the EuroWordNet project and contains semantic roles and links to the Czech wordnet semantic network. The VALLEX 1.0 format is based on the formalism of the Functional Generative Descriptio...
متن کاملValency Lexicon of Czech Verbs: Towards Formal Description of Valency and Its Modeling in an Electronic Language Resource
Valency refers to the capacity of verb (or a word belonging to another part of speech) to take a specific number and type of syntactically dependent language units. Valency information is thus related to particular lexemes and as such it is necessary to describe valency characteristics for separate lexemes in the form of lexicon entries. A valency lexicon is indispensable for any complex Natura...
متن کاملValency Frames Of Czech Verbs In VALLEX 1.0
The Valency Lexicon of Czech Verbs, Version 1.0 (VALLEX 1.0) is a collection of linguistically annotated data and documentation, resulting from an attempt at formal description of valency frames of Czech verbs. VALLEX 1.0 is closely related to Prague Dependency Treebank. In this paper, the context in which VALLEX came into existence is briefly outlined, and also three similar projects for Engli...
متن کاملValency Lexicon of Czech Verbs VALLEX: Recent Experiments with Frame Disambiguation
VALLEX is a linguistically annotated lexicon aiming at a description of syntactic information which is supposed to be useful for NLP. The lexicon contains roughly 2500 manually annotated Czech verbs with over 6000 valency frames (summer 2005). In this paper we introduce VALLEX and describe an experiment where VALLEX frames were assigned to 10,000 corpus instances of 100 Czech verbs – the pairwi...
متن کاملAutomatic Valency Derivation for Related Languages
This paper describes an experiment combining several existing data resources (parallel corpora, valency lexicon, morphological taggers, bilingual dictionary etc.) and exploiting them in a task of building a valency lexicon for a related language (Russian) derived from a high quality manually created valency lexicon for Czech (Vallex) containing several thousands of verbs with very rich syntacti...
متن کامل